An Implicit Shape Model for Combined Object Categorization and Segmentation
نویسندگان
چکیده
We present a method for object categorization in real-world scenes. Following a common consensus in the field, we do not assume that a figure-ground segmentation is available prior to recognition. However, in contrast to most standard approaches for object class recognition, our approach automatically segments the object as a result of the categorization. This combination of recognition and segmentation into one process is made possible by our use of an Implicit Shape Model, which integrates both capabilities into a common probabilistic framework. This model can be thought of as a non-parametric approach which can easily handle configurations of large numbers of object parts. In addition to the recognition and segmentation result, it also generates a per-pixel confidence measure specifying the area that supports a hypothesis and how much it can be trusted. We use this confidence to derive a natural extension of the approach to handle multiple objects in a scene and resolve ambiguities between overlapping hypotheses with an MDL-based criterion. In addition, we present an extensive evaluation of our method on a standard dataset for car detection and compare its performance to existing methods from the literature. Our results show that the proposed method outperforms previously published methods while needing one order of magnitude less training examples. Finally, we present results for articulated objects, which show that the proposed method can categorize and segment unfamiliar objects in different articulations and with widely varying texture patterns, even under significant partial occlusion.
منابع مشابه
Combined Object Categorization and Segmentation with an Implicit Shape Model
We present a method for object categorization in real-world scenes. Following a common consensus in the field, we do not assume that a figureground segmentation is available prior to recognition. However, in contrast to most standard approaches for object class recognition, our approach automatically segments the object as a result of the categorization. This combination of recognition and segm...
متن کاملCognitive Vision Systems CogVis
We present a method for object categorization in real-world scenes. Following a common consensus in the field, we do not assume that a figureground segmentation is available prior to recognition. However, in contrast to most standard approaches for object class recognition, our approach automatically segments the object as a result of the categorization. This combination of recognition and segm...
متن کاملShape-from-recognition: Recognition enables meta-data transfer
Please cite this article in press as: A. Thomas e (2009), doi:10.1016/j.cviu.2009.03.010 Low-level cues in an image not only allow to infer higher-level information like the presence of an object, but the inverse is also true. Category-level object recognition has now reached a level of maturity and accuracy that allows to successfully feed back its output to other processes. This is what we re...
متن کاملمدلسازی تاثیرات پسروی دریاچه ارومیه بر روستاهای ساحل شرقی دریاچه ارومیه با پردازش شیءگرای تصاویر ماهوارهای
Urmia Lake is one of the largest hyper saline lakes in the world and largest inland lake in Iran which located in the north west of Iran, between the provinces of East Azerbaijan and West Azerbaijan. The lake basin is one of the most influential and valuable aquatic ecosystems in the country and registered as UNESCO Biosphere Reserve. In addition, it is very important in terms of water resource...
متن کاملLatent mixture vocabularies for object categorization and segmentation
The visual vocabulary is an intermediate level representation which has been proved to be very powerful for addressing object categorization problems. It is generally built by vector quantizing a set of local image descriptors, independently of the object model used for categorizing images. We propose here to embed the visual vocabulary creation within the object model construction, allowing to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006